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^^ ! Abstract 

f^ , Some, but not all processes of the form Alt = cxp(— ^t) for a pure-jump subordinator ^ 

^SJ ' with Laplace exponent $ arise as residual mass processes of particle 1 (tagged particle) in an 

^ I exchangeable fragmentation processes. We introduce the notion of a Markovian embedding 

MH' of M in an exchangeable fragmentation process and show that for each $, there is a unique 

•^ I binary dislocation measure v such that M has a Markovian embedding in an associated 

exchangeable fragmentation process. The identification of the Laplace exponent $* of its 

tagged particle process M* gives rise to a symmetrisation operation $ M- $*, which we 

investigate in a general study of pairs (Af, M*) that coincide up to a junction time and then 

evolve independently. We call M a fragmenter and (M, M*) a bifurcator. 

Mh . For all $ and a > 0, we can represent a fragmenter M as an interval Ri ~ [0, /„°° M"dt\ 

r^ I equipped with a purely atomic probability measure fix capturing the jump sizes of Mt after 

jrt ' an a-self-similar time-change. We call (i?i,yUi) an {a,^)-string of beads. We study binary 

tree growth processes that in the nth step sample a bead from fin and build (i?„+i, ^n-i-i) by 

splitting the bead into a new string of beads, a rescaled independent copy of (i?i,/ii) that 

we tie to the position of the sampled bead. We show that all such bead splitting processes 

converge almost surely to an a-sclf-similar CRT, in the Gromov-Hausdorff-Prohorov sense. 
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^' ■ 1 Introduction 

O 

en ' We call a process M := {Mt,t > 0) a multiplicative subordinator or fragmenter for short if 

Mt = exp(-ei), t > 0, 

for some subordinator {^t,t > 0). As shown by Pitman [25] and Bertoin [5], such processes arise 
naturahy in the theory of continuous-time processes of coagulation and fragmentation. The 
C^ . process (1 — Mt,t > 0) is the random cumulative distribution function of a random discrete 

probability measure on (0, oo). These random measures have been studied in the theory of 
Bayesian non-parametric statistics [12 ^1131 [22] . not just for subordinators ^, but more generally 
for increasing processes with independent increments which are not necessarily stationary. 

We will use terminology based on the fragmentation interpretation of Mt as the residual 
mass containing a particle at time t. Bertoin [5] showed that the mass containing particle 1 in 
an exchangeable homogeneous fragmentation process is a fragmenter. Let us recall the definition 
of a homogeneous fragmentation process (HFP). We denote by P = Vfq the set of partitions of 
N. An exchangeable HFP is a Markov process H = (n(i),t > 0) in P such that 

• given n(t) = {Bi, i > 1} G V, the distribution of n(t -|- s) is as for the collection of blocks 
of Bi n n(*)(s), i > 1, for a family 11^^ > , i > 1, of independent copies of H, and 

• the distribution of n is exchangeable, i.e. invariant under all permutations of N. 
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It is a well-known consequence of de Finetti's theorem that exchangeable partitions have asymp- 
totic frequencies, so |nj(t)| = lim„_>.oo w~^#(ni(t) n {1, . . . ,n}) exists a.s., in fact jointly for all 
i > 1 and t > 0, see [5]. Referring to an asymptotic frequency as mass, M^ := |ni(f)| is 
the residual mass of the block ni(t), which contains particle 1. Bertoin showed that for every 
exchangeable HFP 11, this process M* is a fragmenter. We will call M* := |ni| the canonical 
fragmenter of IT. As Haas [18] demonstrated, there are exchangeable HFPs with different dis- 
tributions whose canonical fragmenters have the same distribution. On the other hand, in the 
subclass of binary models, where every infinitesimal split is into two parts (see Section [2 .Sp . we 
show this cannot happen. 

The starting point for this paper is the observation that not all fragmenters arise as canonical 
fragmenters in an exchangeable HFP. Furthermore, we have encountered a number of natural 
non-exchangeable models [iSl [101 ESI [HI [27j , in which masses of blocks can be defined as asymp- 
totic frequencies, and the mass containing particle 1 is also a fragmenter. Via embedding into 
an exchangeable model or via limit considerations, we have found an associated exchangeable 
model in all those cases. The purpose of this paper is to address the following converse questions, 
for a suitably general notion of "embedding": 

1. Can every fragmenter M be embedded as a residual mass process in an exchangeable HFP? 

2. For any fragmenter M embeddable in a binary HFP, is the distribution of the HFP unique? 

3. How does an embedded fragmenter M relate to the canonical fragmenter M*? 

To address these questions for strictly decreasing pure-jump fragmenters M in a binary HFP 
n, Section 12.11 offers a study of pairs of fragmenters (M, M) that coincide up to a junction 
time, after which M and M evolve independently. We call such pairs bifurcators. We study 
pairs where M is obtained from M via certain switching probabilities p{u), < n < 1, that 
we also used for the alpha-theta model in |28j . We show that M can then be recovered from 
M by dual switching probabilities (Proposition [TJ and that all bifurcators are of this form 
(Proposition [5]) . In Section 12.21 we identify a special case of size-biased switching which induces 
an idempotent transformation from distributions of M to M* . This transformation is the key to 
finding the exchangeable binary fragmentation model associated with M. In Section [2l3] we recall 
known facts about exchangeable fragmentations H. In Section [2.41 we introduce the notion of a 
Markovian embedding of M in H, which allows us to embed every fragmenter M into a binary 
exchangeable HFP H in Theorem 1141 answering Question 1. Theorem 1141 also claims that the 
distribution of H is unique, hence answering Question 2, but we have to postpone the proof to 
Section [3j The embedding is explicit, and the proof of the first part of Theorem [TH identifies 
the bifurcator {M,M*) from Section [2.21 hence answering Question 3. In Section [2.51 we study 
the three-way mass split into the parts before and after the junction time, and in Section 12.61 
associated lengths induced by a bifurcator. 

To complete the proof of Theorem 1141 we use Haas and Miermont's [19] ce-self-similar CRTs 
{T,fJ.), which are certain random rooted compact metric space trees (7~, d, 0) equipped with a 
probability measure /x. Specifically, a random element T,^ £ T with distribution fj, yields a path 
TZl = [[0, S|]] in T. For {T,fi) associated with H (and some a > 0), it is well-known that the 
process of //-masses in subtrees above points in T^i is related to a copy of M* by a-self-similar 
time change. Furthermore, 7^* = U?=i[[Oi ^^]] increases to T for a sample S*, n > 1, from /x. 
See Section [23] for details. 

We project /i onto 7^* to equip 7^* with a random discrete distribution /x* . In particular, 
(7^^, /i^) is a string of beads, i.e. an interval equipped with a purely atomic measure, and {TZ\, fi^) 
can easily be completely expressed in terms of M*, see Section [3TT1 Note that 7^*_|_i is a tree 
with one more branch than 7?-* , and //* is the projection of n^+i onto 7^* . Since ^^+1 ^^ selected 
according to ft, we have an instance of the following general notion of a bead splitting process: 



Let (i?i,/ii) be a string of beads. 

Given that i?„ has been defined with a purely atomic probabihty measure fin, pick a 
bead J„ according to /_f„. Given that fini{Jn}) = "^i remove the bead from J„, spht it into 
smaller beads and tie to J„ a string with these beads of total mass m to form {Rn+i, fJ-n+i)- 




Figure 1: A tree equipped with strings of beads; splitting a bead into a new string of beads. 

See Figure[T]for an illustration. A similar bead splitting process, but with different bead selection 
rules, was obtained for the alpha-theta model of [28] by exploiting properties of the Chinese 
Restaurant process. In Section [3l we address the following general questions: 

4. Can we give an autonomous description of the evolution of ((7?.* ,/i* ),n > 1) that is 
meaningful outside a CRT? 

5. Can we generalise this to start from a string of beads associated with an arbitrary frag- 
menter M rather than a canonical fragmenter M*? 

6. Does this lead to CRT convergence for every fragmenter M? 

7. What more can we say in special cases such as the Brownian CRT? 

In Theorem [21] we show that by using rescaled independent copies of {Ri , //i ) associated with 
M at each growth step, the bead splitting process converges almost surely for the Gromov- 
Hausdorff-Prohorov metric to a CRT (T, /i) associated with a HFP whose canonical fragmenter 
is M* as identified earlier, answering Questions 4, 5 and 6. As tools, we develop a general 
spinal decomposition of exchangeable HFPs along a Markovian path (Lemma [22]) and show a 
CRT convergence result based on any Markovian path A (Lemma [23]), which we then also use to 
complete the proof of Theorem 1141 The transition kernel from {Rn, /^n) to {Rn+i, fJ-n+i) is simple 
for all fragmenters and gives an inductive description of the distribution of {Rn, fJ-n) for every 
n > 1. Section 13.31 addresses Question 7, providing direct descriptions of the distribution of 
(7^*,/x*) in the special case of the Brownian CRT, exploiting relations to Aldous's line-breaking 
construction [2] , Brownian path transformations [5] and Poisson-Dirichlet distributions 



2 Fragmenters and their embedding in fragmentation processes 

This section studies a natural class of models for tracking two residual mass processes {Mt,t > 0) 
and {Mt,t > 0) that we can think of as parts in a fragmentation process. It is instructive to 
consider these models in their own right without their reference to a fragmentation process. 
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2.1 Fragmenters, switching transformations and bifurcators 

It will be assumed throughout this section that all fragmenters Mj = exp(— ,^t)) t > 0, are derived 
from subordinators ^ with zero drift and no killing. Furthermore, for most of our discussion, 
we will also assume an absolutely continuous Levy density. The Levy-Ito representation of ^ is 
then 

0<s<f 

where {{s, A^s) '■ s > 0, A(^s > 0} is the set of points of a Poisson random measure on (0, oo) x 
(0, oo) with intensity measure ds X{x)dx where A is the Levy density of the subordinator, so 

is the Laplace exp 
fragmenter M are 



where {{s, Fg) : s > 0, -F, < 1} is the set of points of a Poisson random measure on (0, oo) x (0, 1) 
with intensity measure ds uf{u)du on (0, oo) x (0, 1), where uf{u)du is the push-forward of X{x)dx 
via the transformation u = e~^. So for all nonnegative Borel functions g, 

g{e '')\{x)dx = I g{u)uf{u)du. (2) 

Jo 

We introduce the size-biasing factor u in the definition ([2]) of f{u) to simplify applications to 
fragmenters associated with (binary) homogeneous fragmentations [7], which we define more 
formally in Section 12.31 and explain briefly in the next paragraph. We call / the splitting density 
of the fragmenter, which is related to A by 

/(u) =u-2A(-logu), 0<'U<1. (3) 

By ([2]) for g{u) = 1 — u^, the Laplace exponent of ^ is then 

$(p) =/"(!- uP)uf{u)du, p > 0. (4) 

Jo 

Note that / is subject to the integrability condition that $(p) < oo for some (hence all) p > 0, 
that is 

$(1) = / u{l-u)f{u)du<oo. (5) 

Jo 

The Levy-Khintchine formula ([1]) now provides a Mellin transform for the fragmenter: 

E(Mf) = e-**(^), t>0. 

If {Mt,t > 0) is the mass of a randomly tagged fragment in a binary homogeneous fragmentation 
process with a dislocation measure franked concentrated on decreasing non-negative sequences 
s = (si, S2, ■ ■ .) with si + S2 = 1, then M admits the above descriptions, assuming the existence 
of a density X]j=i '^ranked (sj G du) = f{u)du. The size-biasing factor u then arises because / 
is necessarily symmetric, meaning f{u) = /(I — u), and given a mass split s = (u, 1 — u) with 
u > 1 — u from franked (c^s) , the randomly tagged fragment will be found in component u with 
probability u and in component 1 — u with probability 1 — u, mapping a Poisson point process 
of mass splits franked(o?s) to a Poisson point process with intensity uf{u)du. 

To further study fragmenters embedded in a homogeneous fragmentation process, we consider 
the following switching transformation of one fragmenter M into another fragmenter M. Let p 



be a non-negative measurable function from (0, 1) to [0, 1]. If M is a fragmenter with Levy-Ito 
representation Mt = no<s<t-^*' consider the process 

Mt:= Y[ Fs (6) 

0<s<t 

where conditionally given M the factors -F, are defined by Fs = Fg with probability 1 — p{Fs), 
and -Fs = 1— -F^ with probability p(F's). Here the construction of the point process {{s,Fs),s > 0) 
from the point process {{s,Fs),s > 0) with intensity dsuf{u)du is made rigorous in the usual 
way by some arbitrary indexing of these points by positive integers, and making independent 
choices for each of the countable number of Fs with Fg <1. Here and below it is always assumed 
that all processes are defined on a rich enough probability space to admit all necessary auxiliary 
randomizations, as are involved in passing from {{s,Fs),s > 0) to {{s,Fs),s > 0). Standard 
transformation results for Poisson point processes imply that {{s,Fs),s > 0) is a Poisson point 
process with intensity dsuf{u)du for / determined by the formula 

uf{u) = {l-p{u))uf{u)+p{l-u){l-u)f{l-u), 0<u<l. (7) 

In particular, provided L u{l — u)f{u)du < oo, that is 

/ u^p{u)f{u)du < oo, (8) 

Jo 

the function / serves as a splitting density, and ([6]) is the Levy-Ito representation of a fragmenter 
M with splitting density /. Call M the fragmenter derived from M by switching according to p. 
The following proposition provides a summary: 

Proposition 1 // M is a fragmenter with splitting density f , and p is subject to ® then M 
derived from M by switching according to p is a fragmenter with splitting density f as in ^. 
Moreover, M is then derived from M by switching according to p, where 

^, , p{\-u)(\-u)f{\-u) 

p(u) = ^ ^SrV^ ^ 0<n<l. (9) 

This generalises |28l Lemma 19(b)], which treats the case of size-biased switching probabili- 
ties p{u) = 1 — u. 

Observe that two fragmenters M and M as above are by construction such that Mt = Mt 
for <t < T where 

r := mi{s : F, / FJ 

is the time of the first switch. It is clear from the Poisson construction that r is exponential 
with rate 

cf) := p{u)uf{u)du= I p(n)ii/(ii)(iii € [0, oo], 
JO Jo 

where we will usually exclude the trivial cases (^ = 0, i.e. r = oo, and i?i> = oo, i.e. r = 0. The 

conditional distribution of r given M is made explicit by the formula 

P(T>t|M)= J] (l-p(F.)), (10) 

0<s<t 

where Fs = Ms/Mg-- Assuming further that < ^ < oo, so P(0 < r < oo) = 1, it is clear by 
construction that Ad and M satisfy the identification rule 

Mt = Mt for < t < r, (11) 

hence M^_ = Mt-_ , and the binary splitting condition that the decrement of each fragmenter at 
time T equals the value of the other fragmenter at time r: 

Mr_ -Mr = Mr and Mr_ -Mr = Mr. (12) 



Call (jlip and (J12p together the binary junction conditions. After time r the random factors 
governing the evolution of M and M are further coupled. We now modify this construction so 
that the two fragmenters continue independently after time r: 

Definition 2 (Bifurcator) We call a pair of fragmenters (M, M) a bifurcator with switch- 
ing time T, if there are a splitting density / and a switching probability function p so that 
Jq p{u)uf{u)du < oo and (M, M, r) has the following joint distribution: 

• M is a fragmenter with splitting density /, 

• r is the first switching time of an auxiliary fragmenter M derived from M by switching 
according to p, 

• Mt = Mt,t<T,Mr = Mr--Mr and (MV+i/MV , t > 0) is a copy of M independent of (M, r) . 

See Propositions [Hand [5] for characterisations that may serve as alternative definitions. 

Note that in our construction, M = Af on [0, r], so the binary junction conditions (jlip and 
()12p hold just as well for M as for M. But after time r the evolutions of M and M are decoupled. 
Dually, {Mr+t/Mr,t > 0) is a copy of M which is independent of {M,t). 

Henceforth we will no longer be concerned with any M that is further coupled with M after 
r, and we will instead use the generic notation (M, M) for a bifurcator. Then for some splitting 
time r, whose joint law with M is determined by the switching probability function p, 

Mt = Mtl{r>t} + {Mr- - MrM^rhrKt} (13) 

where M' = M with M' independent of (M, r). Note the subtlety that M is determined pathwise 
by M up to and including the splitting time r, but thereafter the jumps of M and M are 
decoupled: the distribution of how M evolves after time r is implicitly determined by M and p, 
but there is no pathwise coupling between M and M after time r. Rather, (M, M) satisfies 

Definition 3 (Asymmetric Markov branching property) We say that (M, M) has the 

asymmetric Markov branching property relative to the splitting time r if 

• conditionally given t > t the process {{Mt+y/Mt,Mt-i.v/Mt),v > 0) is a copy of {M,M), 
independent of {{Ms, Ms),0< s <t), 

• conditionally given t <t the two processes {Mt+^/Mt^v > 0) and {Mt+^/MIt),v > 0) are 
independent copies of M and M respectively, independent of ((M^, Afs),0 < s <t). 

The following variation of Proposition [T] follows easily from that proposition, standard facts 
about Poisson point processes, and the above definition: 

Proposition 4 The joint distribution of the bifurcator {M, M) is uniquely determined by the 
splitting density f of M and a switching probability function p subject to 

< := / p{u)uf{u)du < oo (14) 

Jo 

or dually by the splitting density f of M and the dual switching probability function p, subject 
to < (f> < oo, as specified in ([7|) and ([9]). Then (p = (p is the rate of the exponentially 
distributed junction time t. Such a bifurcator {M, M) can also be constructed as follows from five 
independent ingredients: three fragmenters M^, M' and M' with splitting densities {l—p{u))f{u), 
f{u) and f{u) respectively, an exponential time t with rate 4>, and a random variable U G (0, 1) 
with distribution 

¥{U G du) = (t)-^p{u)uf{u)du, < u < 1. (15) 

Now define {M, M) by Mt = Mt = M^ fort <t, and let 

Mr+v = UM^M'^ and M^+y = (1 - U)M^M'^ for u > 0. (16) 



We assumed for ease of exposition that M has a sphtting density /. However, the operation 
of switching according to p and the notion of an associated bifurcator are meaningful when we 
replace uf{u)du by a more general measure A{du) satisfying Cq ^^^{1 — u)A{du) < oo, leading to 

dual A{du) and p{u) that are uniquely identified (in the case of A, and for A-a.e. u £ (0, 1) in 
the case of p{u) ) by generalisations of d?]) and @ : 

A{du) = (1 - p{u))A{du) + p(l - u)A{du) and p{u)A{du) = p{l - u)A{du), (17) 

where A is the image measure of A under the switching operation u i— )• 1 — u. With this 
generalisation, note that r = inf{s > 0: Kj 7^ Fg} can fail when p(l/2)A({l/2}) > 0, and that 
r = inf{s > 0: Mg 7^ Mg} can still fail for a bifurcator {M,M), but only when furthermore A 
is finite. One way to include those cases is to slightly re-model the switching transformation 
by marking {{Fs),s > 0) by a marking kernel K : (0,1) — )• {0,1}, where K(u,l) = p{u) and 
K(n, 0) = 1 — p{u), with associated marked point process {{Fs,'ms),s > 0) mapping to Fg = 
(1 — mg)Fg + mg{\ — Fg), and with r = inf{s > 0: rUg = 1}. In the following characterisation of 
bifurcators, it is more natural to exclude the cases when r 7^ inf{s > 0: Mg 7^ Mg}. Note that 
for those cases the analogue of (fT2]) at inf{s > 0: Mg 7^ Mg} fails since this is the first of the 
jump times after r, and the respective first jump times of M and M after r will be different a.s. 

Proposition 5 Consider a pair of positive non-increasing pure jump processes (M, M) and 
suppose that t = inf{t > 0: Mt 7^ Mt} G (0,oo) a.s.. Then {M,M) is a bifurcator with splitting 
time T if and only if the asymmetric Markov branching property of Definition\M holds relative to 
T, together with the binary junction conditions (jlip and ()12p . 

Proof. The if part is straightforward. For the only if part suppose that (M, M) satisfies the 
asymmetric Markov branching property. Then Definition [3] implies that {Mt.,Mt,lsr>t}) is a 
Markov process in its natural filtration. Furthermore, each component is Markovian in its own 
right with respect to this filtration. More specifically, we see that M and M are fragmenters 
with some Levy measures A and A, and that r is exponentially distributed with some rate (j). 
From (I12p . we have that r is a common jump time of all three components, so we can consider 
T as a marking time for each of the Poisson point processes ((s, F^), s > 0) and ((s, Fg), s > 0), 
with mark 1 at r, say. By Lemma [6] below, this yields marking kernels K,K: (0,1) — t- {0,1}, 
from which we define p{u) = K{u, {1}) and p{u) = K{u, {1}). By standard results for marking 
and thinning Poisson point processes, we find 

F{Fr G du) = 4>~^p{u)A{du) and P(F^ G du) = (j)~^p{u)A{du), 

and the points before r, which by ()lip are common to both processes, have equal thinned 
intensity measures 

{1 - p{u))A{du) = {l-p{u))A{du). 

Together with ()12p . these equations are equivalent to (I17p . and we easily deduce that (M, M) is 
indeed a bifurcator with splitting time r in the sense of Definition [2j D 

Lemma 6 Consider a filtration T , an J- -Poisson point process {Ft,t > 0) with intensity measure 
A on (0, 1) and cemetery 1, an T-stopping time r such that F^- ^ 1 a.s. and such that condi- 
tionally given t > t, we have (r — t, {Ft+g, s > 0)) = (r, {Fg, s > 0)), for all t > 0. Then there 
exists a marking kernel K: (0, 1) — t- {0, 1} such that for a Poisson point process {{Ft , "frit) , t > 0) 
with intensity measure A~^ {du, dm) = K{u,dm)A{du) and for r = inf{t > 0: fht = 1}, we have 
{{Ft,t>0),T)^{{Ft,t>0),T). 

We prove Lemma [6] in Appendix El 



Remark 7 While a splitting density / and a switching probability function p together uniquely 
identify the distribution of a bifurcator, for two given splitting densities / and /, there may not 
be an associated bifurcator (M,M). Looking ahead at Theorem 1141 this will, in fact, be the 
typical case. On the other hand, for fragmenters M and M that can be coupled to form a 
bifurcator, there are typically many other couplings as different bifurcators. This can be seen 
from ([7]), which for each u leaves some choice of p{u) and p{l — u). We will see in Remark [151 
that for any choice with both p{u) and p{l — u) in [0, 1], the equation ^ for 1 — u instead of u, 
which appears to give a second equation relating p{u) and p{l — u), will automatically hold if 
(and only if) the two fragmenters can be embedded in the same fragmentation process. 

Example 8 An extreme example of a switching probability function is p{u) = 1 for u < 1/2 
and p{u) = for u > 1/2, in words: switch if the other block is bigger. We then obtain from 
dH) that uf{u) = uf{u) + (1 - u)f{l - u) for u > 1/2 and uf{u) = for u < 1/2. Note that 
in the context of Remark [TJ there is only one bifurcator (M, M) which has a given / and this 
associated / as splitting densities. 

2.2 Size-biased branching 

The instance of the bifurcator construction of the previous section with 

p{u) = 1 — u, 0<n<l, 

is of special interest. We then say that the bifurcator (Af , M) is derived from M by size-biased 
branching, and use the notation (M, M*) instead of (M, M) to indicate this special construction. 
Note that the "size" involved in the size-biasing is the size of the residual factor 1 — u associated 
with decrements of M by a factor of u, that is the size relative to the current value of M of the 
fragment that splits. 

We note the following corollaries to the results obtained in the previous section. 

Corollary 9 If M is a fragnienter with splitting density f and Laplace exponent 

$(p) =/"(!- uP)uf{u)du 
Jo 

then M* derived from M by size-biased branching is a fragmenter with splitting density 

f*{u)=uf{u) + {l-u)f{l-u), 0<u<l, (18) 

and Laplace exponent 

<^*{p)= [ il-u'')uf*{u)du = '^{p + l)-<^{p+l,p + l), p>0 (19) 

Jo 

where ^{p + 1, p + 1) is given by 

$(p + l,/j + l)= / {l-u)P+^uf{u)du. 
Jo 

Moreover, M is then derived from M* by switching according to p* , where 

p-C.), "-")/'!-") . 0<„<1. (20) 

f*[u) 



Corollary 10 In the setting of the previous corollary, the following conditions are equivalent: 
(i) / is symmetric: f{u) = f{l — u) for all < u < 1; 

(ii) / = /*; 
(iii) $ = $*; 
(iv) p*{u) = 1 — u for all < u < 1; 

(v) M = M*; 

(vi) {M,M*) = {M*,M). 

Observe from (|18|) that whatever the splitting density f of M, the splitting density /* of M* 
is symmetric. Thus the operation of passing from the law of M to the law of M* by size-biased 
branching is a kind of symmetrisation of laws of fragmenters corresponding to the elementary 
symmetrisation of density functions defined by formula (jlSp . The operation is idempotent: 
M** = M* . So we make the following definition: 

Definition 11 (Symmetrised fragmenter) For a fragmenter M with splitting density / call 
the fragmenter M* with splitting density /* as in (jlSp the symmetrisation of M. 

This notion of size-biased branching and symmetrisation can clearly be extended to frag- 
menters whose splitting measures do not have a density, as is achieved by the formula (J19p 
for Laplace exponents. Note however that the probabilistic meaning in terms of size-biased 
branching, and even the analytic fact that ^** = <!>*, is very much obscured by the Laplace 
exponents. 

As can be seen in the symmetry discussion leading up to (j2ip below, this operation of sym- 
metrisation of M projects the collection of laws of all fragmenters M onto the collection of laws 
of fragmenters M* which are canonically associated with a binary homogeneous fragmentation 
processes via the mass of a uniformly randomly tagged fragment. This raises the question: 
exactly how is a fragmenter M with splitting density / related to the binary homogeneous 
fragmentation process with splitting density /*? We answer this question in Section 12. 4^ after 
development of the necessary framework in Section | 



2.3 Exchangeable fragmentation processes and self-similar CRTs 

In our context, we can express Bertoin's [5l |^ definitions of homogeneous and self-similar frag- 
mentations as follows. For a S M, we say that a family IIq, = (Ila{t),t > 0) of refining partitions 
in P = Vfq is an exchangeable a- self- similar fragmentation process if both 

• IIq, is exchangeable in that the distribution of IIq, is invariant under permutations of N, 

• IIq, is a right-continuous strong Markov process whose transition kernel satisfies the branch- 
ing property: for all i > 0, s > 0, conditionally given Iia{t) = {Bi^i > 1}, the partition 
Ila{t + s) has the same distribution as the partition of N with blocks Bi n IIq (|i?j|~°s), 

(i) 

i > 1, where the IIq , i > 1, are independent copies of IIq. 

Usually we consider lla{0) = {N}, and we exclude the trivial case Ii.a{t) = {N} for all t > 0. 
Then nQ(oo) = {{1},{2}, . . .}. In the case a = 0, the linear time-changes of IIq by asymp- 
totic frequencies IFjI" disappear; this case is called an exchangeable homogeneous fragmentation 
process. Bertoin [5] showed that the distribution of II = Hq can be expressed in terms of an 
exchangeable a-finite intensity measure K,{dT) on "P \ {{N}} via a Levy-Ito-type decomposition 
into elementary splits of blocks B into B CiT. The measure n admits an integral representation 



K{dr) = cT^ (5|N\|i| |i||((ir) + / Ks((ir)l/ranked(ds), 

for an erosion coefficient c > and a ranked dislocation measure franked on S^ \ {(1,0, . . .)} 
satisfying /ci(l — si)i'{ds) < oo, and where Kg is Kingman's paintbox governing exchangeable 
partitions with asymptotic frequencies s ^ S^ := {(sj)j>i: si > S2 > • • • > 0,^j>]^Si < 1}. 
In the binary case k{T £ V\ {N} : Fi U r2 7^ N) = 0, this representation can be written as 

n>i ^ -^(0.1) 

for a symmetric dislocation measure v on (0, 1) satisfying /,„ ^s s(l — s)zv((is) < 00 and v = V^ 
where V is the push-forward of v under n 1— ;• 1 — u, so v{du) = frankcd('5i G du) + frankcd('52 G du) 

and l^ranked(si £ • ) = Z^( " H (i, 1)) + il/( • H {i}) = I7( • n (i, 1)) + il7( • n {\}). 

We denote by |^^(t)| the asymptotic frequency of the block A^{t) of Tia{t) containing n. 
For a = 0, the process |^o(t)| is a fragmenter, and £,n{t) = —log |^o(t)| has Laplace exponent 



$(p) = c + c/9+/ {l-uP)uv{du), (21) 

^(0,1) 

see [5] . Self-similar and homogeneous fragmentation processes are pathwise related by non- linear 
time-change [6]. Specifically, 

\Al{t)\ = exp (-^„(r?„(t))) , where r]n{t) = inf |n > : ^ e^^'^-^'^Uw > t\ , (22) 

is a self-similar Markov process, and for a > 0, we have |^J|J(t)| = for t > f^ e~'^^"^^'dw. 

It was shown by Haas and Miermont [19] that for every exchangeable self-similar fragmenta- 
tion process IIq, with index a > 0, zero erosion c = and infinite dislocation measure franked on 
5^ = {s E 5^'' : Yli>i -Sj = l}i there is an associated compact continuum random tree (T, d, 0, /i). 
Vice versa, such a continuum random tree (CRT) allows an embedding of a self-similar frag- 
mentation process. Specifically, a CRT is a random weighted and rooted M-tree. A weighted 
and rooted M-tree (T, d, 0, n) is a complete, separable metric space (T, d) with a root G T and 
a probability measure /x on the Borel sets of (T, d), such that the following tree property holds: 

• Any two points a, a' € T are connected by a unique injective path [[cr, a']]. Further- 
more, this path can be parametrised [[o", a']] = {ga,a'{t),0 <t< d{a,a')} by an isometry 
g^y : [0, d{a, a)] -^ T with ga,a'{0) = a and ga,a'{d{a, a')) = a'. 

When there is no ambiguity about d, or even /i, we simply write (7~, /i) or even T to refer 
to a CRT (T, d, 0, //). For the purpose of convergence of compact weighted and rooted M-trees, 
we will identify (T, d, 0, /x) and {T',d' ,0', /j,') if there is an isometry between {T,d) and {T',d') 
that maps to 0' and pushes // forward to /x'. The set T of such isometry classes can then be 
equipped with the so-called Gromov-Hausdorff-Prohorov distance c?ghp- Then (T,dGHp) is a 
Polish metric space. 

For an R-tree (T, d, 0,/i), let T* := {a G T: d{0,a > t}, t > 0, and define fringe subtrees 
To- := {a' £T: a£ [[0, a']]}, a£T. An a-self-similar CRT is a random weighted and rooted M-tree 
(T, d, 0, fj,), or its isometry class with distribution on the Borel space of (T, (icHp)) such that 

• ;U is non-atomic with dense support a.s., /x(7^) > for all cr G T with 7^ / {a}, while 
^([[0, a]]) =0 for ah a eT, and 

• for all t > 0, the connected components (7^,i > 1) of 7"*, completed by a root vertex Oj, 
are such that given (/x(7J*), z > 1) = (ttij, i > 1), for some mi > m2 > • • • > 0, the trees 

O7^'m~''d\j-t,0i,mr^lJ'\ri)7 ^>1 
are independent and identically distributed isometric copies of {T,d,0, /j,). 
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Recently, Stephenson ^9\ extended this class by relaxing the first bullet point to allow a support 
that is not dense, atoms of /i, and/or positive weights on branches, so as to include all dislocation 
measures franked and erosion c > 0. The CRT (T, n) constructed in [HI [29] is such that 

Kit) = {{j > 1: S* G 7^},i > 1} U {{j},j > 1: S* 7-*}, t > 0, (23) 



n' 



has the same distribution as {Ila{t),t > 0), where conditionally given {T,fJ,) the sequence S, 
n > 1, is independent and identically distributed according to fi. It was shown in [19[I29] that the 
subtrees 7^|, = ljj=i[[0, S^]] C T converge a.s. in the Hausdorff sense for embeddings in ^i(N), 
and this easily entails dcupHT^l^ l^*k)^ C^iA*)) ~^ ^-s-j as A; — )■ oo, where /i^ is the push-forward 
of /U under the projection map vr^* : T — )■ Tl*p., u i— )• (7o,o-(sup{t > 0: 5o,o-(i) G 'T^*k\)- Also, /i 
is then recovered in accordance with Aldous's theory of consistent leaf-exchangeable fainilies of 
trees [1^%, A: > 1) as the weak limit of the uniform distribution z/^ on S|, . . . , S^, as /c — )• oo. 

2.4 Embedding fragmenters in homogeneous binary fragmentation processes 

Let n = (n(i),t > 0) be a binary homogeneous fragmentation process starting from 11(0) = {N}, 
with dislocation measure u{du) = f*{u)du for some symmetric splitting density /* on (0, 1), so 
n takes values in the set of partitions of N. Let A = {Af,t > 0) be a process with values in 
subsets of N. Call A a path in 11 if 

• Ate n(t) for all t > 0; 

• At is decreasing as t increases. 

Definition 12 (Markovian path) We call a path A a Markovian path in 11 if 

• i^t,t ^ 0) is adapted to some filtration {J-'t,t > 0) with respect to which (n(t),t > 0) is 
Markovian, in such a way that for each s > the process {{At,Il{t) D As),t > s) and the 
restriction of {Il{t),t > s) to N — ^s are conditionally independent given J-s- 

To explain the terminology, think of H as embedded by suitable time change in the a-self- 
similar continuum random tree (CRT) associated with H for a > by Haas and Miermont |19j . 
or by Stephenson [29] when i/ is finite. Then At represents the set of leaf labels above some 
internal vertex vt of the CRT in some path leading from the root to a leaf vertex Voo of the tree. 

Let M be a fragmenter. Say that M admits a Markovian embedding in H if it is possible to 
construct H jointly with a Markovian path A such that 

{Mt,t > 0) = {\Atlt > 0) 

where \At\ is the asymptotic frequency of At, which is known to exist almost surely, simulta- 
neously for all i > and all sets At £ n(t). In terms of an associated CRT construction, the 
jumps of |A| would then describe the spinal partition of mass in the CRT along a spine leading 
from the root to some random leaf of the CRT. The most basic example is provided by the next 
proposition: 

Proposition 13 For each positive integer n let At be the block ofIl{t) containing n. Then 

(i) (Bertoin [5]) A^ is a Markovian path in 11 such that \A^\ is a fragmenter with splitting 
density f*. 

(ii) For each pair of positive integers m and n, the pair (|^"|, |^"*|) is a symmetric bifurcator, 
each derived by size-biased branching from the other. 

Proof. Part (ii) follows easily from the (strong) homogeneous branching property of II and 
Corollary [3 D 
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Due to the natural embeddings provided by this proposition, we call the fragmenter M* with 
splitting density /* the canonical fragmenter associated with 11. In terms of the corresponding 
mass fragmentation (|n(t)|~'',t > 0) of asymptotic frequencies |nj(t)|, i > 1, ranked into de- 
creasing order |n(i)|-'' E S'^, the process M* describes the evolution of the mass of the fraginent 
containing a randomly tagged particle. 

Theorem 14 A fragmenter M with splitting density f admits a Markovian embedding in a 
binary homogeneous fragmentation process H if and only if the splitting density of H is the 
symmetrisation f* of the splitting density f of M , as in (llSp . The canonical fragmenter of 11 is 
then the symmetrisation of M . 

Remark 15 Consider any bifurcator {A1,M). By adding equation d?]) and the equation we 
obtain by substituting u by 1 — n in ([7l) , we see that / and / have the same symmetrisation /*. 
By Theorem ll4l M and M can each be embedded in the same binary homogeneous fragmentation 
process H. In fact, the argument used to prove this theorem, can be adapted to prove that the 
bifurcator admits an embedding in 11. We leave the details of this to the reader. 

To prepare for the proof of the theorem, we start with some remarks about paths A in 11. 
For t > let A't := min^f Clearly, Nq = 1. The fact that At decreases as t increases implies 
that {Ntit > 0), is some increasing process. Furthermore, At E n(t) implies that 

At = Af\ t>0. (24) 

Assuming for simplicity that Nt tends to oo as t — )• oo, let = tq < ti < r2 < • • • be the 
successive times of jumps of {Nt,t > 0), and set N{n) = N^-,^, n > 0. Then At = A^ for 
t S [Tn-,Tn+i)- Note that given the random sequence 1 = A^(0) < A^(l) < N{2) < • • • , the times 
T„ can be recovered without further reference to A, from the family of paths A" associated with 
n, as Tn = inf{t > 0: ylj ^ A^ } for each n > 1. Thus there is a natural correspondence 

between paths A in 11 and increasing random sequences {N(n),n > 1) subject to the constraint 
that N{n) = uiinAr^, where in general, the possibility of a finite increasing sequence of 
random length must also be allowed. 

In connection with the a-self-similar CRT T derived from 11, notice that the random times 
r„ are defined in a way which allows corresponding random times 

to be defined, and that in T there is a junction vertex Vn at height r„^Q at which the paths to 
leaves labelled N{n — 1) and N{n) diverge. Here it is assumed that the CRT is equipped with a 
random sample S^, Sg, . . . of its leaves according to its mass measure, and that the homogeneous 
fragmentation 11 = Hq has been constructed by time changing the a-self-similar fragmentation 
associated with the tree and the leaf sample. The path from the root to V^ in the CRT then 
passes through Vi, . . . , Vn-i- Since the CRT is compact, we find a convergent subsequence of 
(Vmn > 1) with limit S, say. Because {Vn,n > 1) is increasing for the genealogical partial order 
-< that puts o" -< o"' iff (T E [[0,0"' [[, the sequence converges to the same limit. Note that S must 
be a leaf almost surely, because if E is not a leaf, then the fringe subtree 7e at S will have 
positive mass, but then A^o = C\t>o ^t ^'^^ have positive liiniting frequency, which contradicts 
n(oo) = {{!}, {2}, . . .}. The path from to S in 7" starts by following the path to 1, then 
branches off in the direction of S^vn), then branches again in the direction of S^(2)) ^tc. This 
could be formalised to give a one-to-one correspondence between paths in 11 and paths in T. 

Proof of the if part of Theorem I14L Suppose that M with splitting density / has as its 
symmetrisation the canonical fragmenter M^ of 11, constructed from the blocks {A\^t > 0) of 
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n containing 1. Construct a bifurcator (M^,M) by switching from M^ according to p* in (|20p . 
as needed to create M up to some branching time ri. For < t < ti let ^j = ^j, and let 
A-r-i = A^_^_ \ A^^, that is the block that splits off from the block containing 1 at time ri. To 
continue the construction of A after time ri, apply the strong Markov property of 11 at ri and 
let A^(l) = uiin{Ari). Set At = A^ ioi ti < t < T2 where T2 — ti is the branching time 
between M^^' := (|A^j^^|/|^^^|, s > 0), which is another copy of M^, and some further copy of 
M created conditionally given M^ and M^^' by the same rule. It is clear that continuing like 
this creates time segments Tj — tj_i which are independent and identically distributed, and fresh 

copies of M^ as needed. The process A with the desired feature that |^| = M can be created 
for all times t > 0. Moreover, by construction this process ^ is a Markovian path in 11. D 

The only if part of the proof will be given in Section [3l 

2.5 Mass distributions 

The bifurcator (M, M* ) with M* derived from M by size-biased branching plays a key role in 
following discussions. This section collects together some basic formulae for the joint distribution 
of the branching time r := inf{t > 0: Mt ^ M^} and the decrements 

Mr^ - Mr = m; and Mr = m;„ - m; 

where Mr- = M*_. The triple of non-negative variables (1 — Mr- , Mr* , Mr) with sum 1 is of 
special interest. In a suitably defined random R-tree 7^e,e* spanned by a root and two leaves 
E and E*, this triple represents the masses of three connected components of the tree formed 
by removal of a particular random junction vertex of the tree. As indicated in the previous 
section, this subtree 7?.s,s* may be naturally embedded in a self-similar CRT T associated with 
a fragmentation process, whose canonical fragmenter is M* . The joint distribution of this triple 
is determined by a formula for its joint moments provided by Gnedin and Pitman J16l page 
477] where this triple is denoted {G,H,D), with the following more elementary interpretation: 
(G, 1 — D) is the interval component covering U in the complement of the range of {Mt,t > 0), for 
U a uniform(0, 1) variable independent of M, and H = 1 — D — G is the length of this interval 
component. The following formulae can be read either from the discussion of the previous 
sections, or from ^6\ page 477]. 

Recall first that for M with Levy exponent $, Levy measure X{x)dx and splitting density /, 
the branching time r has exponential distribution with rate 



$(1)=/ u{l-u)f{u)du= {l-e-'')X{x)dx. 
Jo Jo 

The process (Mj, < t < r) is then the negative exponential of a killed subordinator with Levy 
measure e~^ X{x)dx and killing at rate <^(1). 
Thus 

E[M/' I T > t] = E[M^_ \T = t] = £-**«('') 

where 

$o(p) =/"(!- uP)u^f{u)du = <^{p + 1) - $(1) 
Jo 



and hence by conditioning on r 



m? ] = -r^^^ = 4^- (25) 

^ ^'^ $(l) + <l>o(p) $(/>+!) ^ ' 



This is \16\ formula (57) or (59)]. From [161 formula (28)], or from ()15p . 

F (^ e du^ = (^(l))'^(l - u)uf{u)du, (26) 
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hence 

" ' Mr 



E 



Mr 



Moreover, Mr- and Mr /Mr- are independent, so the last two formulae combine to give 

K(Mfl - i^^i^^. (28, 

which is a simplification of [161 formula (60)]. Next, M* := Mr- — Mr is a size-biased pick from 
the decrements of M, whence 

E[(M;n = i^^|^l^±^, where ^{p+l,p + I) = j\l - uY+^uf{u)du. (29) 

Note that for positive integers p = n say, this is a linear combination of evaluations of $(A;) 
at integers fc < ra + 1, as indicated in |16t formula (25)]. In principle, these Mellin transforms 
determine the distributions of Mr and M* , but there do not seem to be simple formulas for the 
densities of these variables except in special cases. 

Observe that the expected masses of the three components in the junction split are 

E(M;) = (2$(1) - $(2))/$(2) (30) 

E(l - Mr^) = E(M^) = ($(2) - ^(l))/$(2) (31) 

It does not seem obvious intuitively why the expectations of (1 — Mr-) and Mr are always equal. 
Recall from ([T9|) that ^*{p) '■= (^{p+l) — ^{p+l, p+1) is the Laplace exponent corresponding 
to M* with splitting density f*{u) = uf{u) + {l—u)f{l—u). So we obtain the following extension 
of Corollary [TOl 

Corollary 16 Two further conditions, equivalent to each other and to the conditions of Corol- 
lary\10\ including symmetry of f , to f = f* , and to <& = <!>*, are 

(Vfi) Mr /Mr- = M*/Mr- 

(viii) and Mr = M*. 

2.6 Edge lengths and exponential functionals 

Continuing to suppose that (M, M*) is the bifurcator derived from M by size-biased branching, 
as well as the basic triple of masses (1 — Mr-,Mr,Mr) with sum 1, for each p > we may 
consider the triple of exponential functionals 

Lo = / Mfdt, Ls = / Mj/dt and L* = / {M^ydt, 

Jo Jt Jt 

which can be interpreted as the lengths of branches in a suitably defined random p-self-similar 
R-tree with three branches meeting at a junction point, these branches being labelled by for 
the root, and S and * for the two leaves associated with M and M* respectively. Note that 
the definition of the Lj depends on the parameter p, which is suppressed in the notation. In 
particular, if the Levy measure satisfies the regular variation condition J \{x)dx = x~°'i{l/x) 
as X I 0, for some a G (0, 1), these functionals for p = a are of special interest [17], as they 
govern the asympotics of how numbers of new branch points grow along the three branches as 
new branch points are selected by size-biased sampling from the mass distribution on the M-tree 
T^T: s* 5 which assigns the decrements of M except A-Ir- — Mr = M* to the branch from the root 
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to leaf S*, and the remaining decrements of M* to the branch from the junction point to leaf 

S*. Let 



/■OO /"OO 

ios := Lo + Le = / M^dt = / e-P^'dt 
Jo Jo 

roo poo 

Lo* := Lo + L, = / {MlYdt = / e-P^'dt 
Jo Jo 



where ^ and ^* are the two subordinators associated with M and M* . According to a known 
formula for subordinators [9] 

where $ is the Laplace exponent of ^, and the same holds for Lq* instead of Lqe with the Laplace 
exponent <I>* of ^* instead of <1>. Now 

Ln= / e-P^'dt= r' e-P'^'dt 



^{pn + 1) $(p)---$(np)' 


(34) 


<l>(/)n+ l,/9n+ 1) n! 


(35) 


$(/9n+l) $*(p)---$*(np)' 


and Lo* = Lo + (^O^^o* 


(36) 



^0 

JO JO 

where r]t = Ct^{T>t} + ool{T<t} is another subordinator, whose Levy measure is e~'^X{x)dx + 
^(l)<^oo(rf3:^) for X{x)dx the Levy measure of (,. It follows that the Laplace exponent of r/ at p is 
just $(/9 + 1), and hence that 

n' 

E(LJJ) = — — for all n G N. (33) 

^ "^ $(/9+ l)---^(np+ 1) ^ ^ 

Moments of Le and L* can now be found using the distributional identities Ly, = MtLqy, and 
L^, = {M*)pLq^ where Lqs is independent of Mr with Lqe = -^os, and Lq* is independent of 
M* with Lo* = Lo*- Thus 

E(L£) = E(Mr)]E(rSE) = "^^"^ + '^ " "^^""^ "' 

e(l::) = e((m;)''")e(l^j 

Note the two identities in distribution 
Los = Lo + MPLoY 

where Los = -^OE with Los independent of (Lo,Mt-), and Lq* = Lq* with Lo* independent of 
(Lq, M*). As checks, the two equalities of means implied by (f36]) are easily seen to be consistent 
with previous formulae. The equalities of higher moments in (j36p provide identies involving joint 
moments such as E(LqMt-'') for positive integers j and k. In particular, E(LoMr) is determined 
by the second moment formula. But the third moment formula only gives access to a linear 
combination of E(LoMt-'') and E(LQMr), which is not so useful. 

3 Bead splitting processes and continuum random trees 

Recall from Section [23] that every self-similar CRT (T, /i) gives rise to a growing family (7^^, /i|,) 
of weighted M-trees that converges to (7~, /x). As we will demonstrate more formally below, 
picking S^ , -|^ from ^ means that a junction point J^ S TZ'^ is picked from /i^. and that S|.,^ is 
then picked within a subtree rooted at J^, which is a rescaled copy of {T,fi), by self-similarity. 
Then 

"^fc+i = '^l^]]JL^*k+i]] and ^4+1 = f^l- f^U{Jk})Sj'^ + Aife,fc+i) 

where {[[Jli'^k+iW^ l^k k+i) ^^ ^ rescaled copy of (7^^,/ip. Since J^ is picked from /i^, we say 
that ((7?.^,/i^.), k > 1) develops by size-biased branching, generalising the case k = 1 that relates 
to Proposition [13] via the self-similar time change (j22p . 
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3.1 Size-biased bead selection and strongly sampling consistent compositions 

The basic building block for the tree growth process ((7^^, //^), /c > 1) is a family of independent 
copies of (7^^ 1/^1)5 01^ equivalently, a family of independent copies of a fragmenter M* , related 
by the following general construction. 

Definition 17 (String of beads) Given a decreasing pure jump process M and two positive 
real parameters a and m, we construct a string of beads of mass m by placing a random discrete 
measure fiM,a.m on the interval (0, YM,a,m] of random length 

/•CO 

YM,a,m = m- / Mys 

Jo 
according to the formula 






/^M,a,m I "2" / M^S , m" / M^S 1 = mMt- 

If M is a fragmenter with Laplace exponent <I>, we call {[0,YM,a,m], fJ-M,a,m) an {a,^)-string of 
beads of mass m. 

Note that for each t that is a jump time of M, the measure ^M,a,m puts mass m{Mt- — Mt) 
at the location m" Jg Mfds. Now, by repeated application of this scheme, we construct an 
increasing sequence of M-trees {Rn,n > 1), where each Rn is equipped with a random discrete 
distribution /i^. 

Definition 18 (Bead splitting process) Let a > and Af„, n > 1 a sequence of decreasing 
pure jump processes starting from 1. 

• Let {Ri, fii) be the string of beads of mass 1 associated with Mi and a. More specifically, 
let Ri = [0,Si] := [0, YA,fi,a,i] equipped with the usual distance, with root vertex and 
with the random discrete distribution /ii = /iAfi,a,i- 

• Given that i?„, has been defined as an M-tree with root vertex and n leaves Si, ... , S„, 
and equipped with a mass measure fin with total mass 1, let Rn+i be defined as follows. 
Pick a junction point Jn from Rn according to /x„. Given that fJ.n{{Jn}) = t^, remove the 
mass m from point J„, distribute this mass according to a copy ([[Jn> '^n+i\\^ f^n,n+i) of the 
string of beads ([0, lAf„+i,a,m], A'Mn+i.a.m), and then attach this segment to {Rn,fJ'n-'m5j„) 
at J„ to form (i?„+i,/i„+i). 

We refer to the projective sequence {{Rn, fJ'n),^ > 1) of weighted M-trees as a bead splitting 
process that develops by size-biased branching. 

We think of {Rn, /x„) as n pieces of string [0, YM^,a,i], [0, YM2,a,ti{{Ji})], • • • , [0, YM„,a,ii{{Jr,-i})] 
tied at the junction points Ji, . . . , J„_i, with beads according to /i^- The nth growth step selects 
bead J„ of size Hn{{Jn}) and splits it into smaller beads that are placed onto a new piece of 
string tied to Jn- 

The growth process {{Rn, fJ-n),^ > 1) gives rise to an ordered spinal partition of N \ {1} in 
the terminology of |21] . which we can represent by a point process 

n-<i = {nGN\{l}: Ji,„=5o,Ei(s)}, s>0, (37) 

where Ji^n is the branch point that has Si and E.„ in two different subtrees, where [[0, Ji,n]] = 
[[0, Si]] n [[0, S„]], and 50,^1 : [0, d{0, Si)] — ;• [[0, Si]] is the unique isometry with 5o,Si(0) = 0. 
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Proposition 19 Given any bead splitting process {{Rn, ^J'n),^ ^ 1) that develops by size-biased 
branching, the ordered spinal partition W^'^ defined in ([37|) is exchangeable. In particular, if 
we choose Mi to be a fragmenter, then the spinal partition gives rise to a strongly sampling 
consistent regenerative composition structure {Cn,Ti > 1), which records for each n > 1 the 
vector Cn of non-zero block sizes ^^(n™ n[n + l]), s > 0, o/II™ n[n+l], arranged in the spinal 
order of blocks given by the order of positions s > on the spine. 

Proof. The first statement holds since /ii is the projection of fin to Ri for all n > 1, so the 
picks of Jn projected to Ri are exchangeable picks from /ii by the use of size-biased branching. 
The second statement now follows directly from Gnedin and Pitman |16| Theorem 5.2]. D 



Remark 20 Gnedin, Pitman and Yor |17] established limit theorems for the number of blocks 
#C„ of strongly sampling consistent regenerative composition structures, jfCn/n°' — )• YM,a,i a-S-, 
under a regular variation condition. We exploited this in |201 Proposition 7] for exchangeably 
labelled trees to see that reduced trees T„ fc, obtained from the discrete tree shape r„ of Rn 
with unit edge lengths, converge to R^ when rescaling all edge lengths by n°'. This result can 
be generalised to the present setup, when {Mn,n > 1) is a family of independent fragmenters, 
which includes non-exchangeable cases. See also |281 Proposition 14] for the alpha-theta model, 
which adds projected uniform measures that converge to the limiting strings of beads. The bead 
splitting process we identified for the alpha-theta model develops by size-biased branching only 
for 6 = a. For the other cases, we found different bead selection rules in connection with ordered 
Chinese restaurant processes. 

3.2 Convergence of bead splitting processes to self-similar CRTs 

The next theorem establishes CRT convergence of bead splitting processes {{Rn, Hn),n > 1) 
in the sense of Definition [THl not just for the case of symmetric splitting rules /* that relate 
directly to the growth process ((7^*,/i*),n > 1) obtained by sampling from the measure n of 
a CRT (7~, /u), but also for fragmenters {Mn,n > 1) with non-symmetric splitting rules /, with 
convergence to a CRT associated with the symmetrised splitting rule /* associated with /. 

Theorem 21 If the Mn are independent fragmenters with splitting density f , then for each 
a > the sequence of weighted random M-trees {Rn, Hn) converges almost surely in the Gromov- 
Hausdorff-Prohorov metric to a limit tree {T,ii), which is a copy of the a- self- similar tree that 
is canonically associated with a binary fragmentation process with splitting density f*{u) = 
uf{u) + (1 — u)f{\ — u). In addition, we also have {Rn,i^n) — ^ {T,lj) almost surely in the 
Gromov-Hausdorff-Prohorov metric, where Vn is the uniform measure on the n leaves of Rn. 

If f{u) = -r=u~^''^{l — u)~^''^ it follows easily from the work of Haas and Miermont [19] that 
the sequence {Rn, n > 1) has the same distribution as the increasing sequence of trees provided 
by Aldous's [5] line-breaking construction of the Brownian CRT. Therefore, Theorem [2T] can be 
seen as a generalisation of Aldous's line-breaking construction. We discuss this example of a 
bead splitting process {{Rn,iJin),n > 1) in Section [3^ 

To prove this theorem, we will embed {RnjfJ-n) in a CRT (T, //), as has (essentially) been 
done for (i?i,^i) in Theorem 1141 A key tool will be the following spinal decomposition result. 

Lemma 22 (Spinal decomposition) Let A = {At,t > 0) be a Markovian path in a homoge- 
neous fragmentation process H = {Il{t),t > 0). For each n > 1, denote by Il^"'\t) the block of 
n(t) containing n, t > 0, consider an = inf{t > 0: n At} and the associated spinal partition 
n (0) = {n'"'((Tn),n > 1}. Then conditionally given II (0) and {(Tn,'^ > 1); the process 

u^{t) = {nW(a„ + 1), n > 1} = U n(a^i,nf (0) + t)n nf (o), t > o, 

«>i 

is a fragmentation process starting from n"^(0), with the same transition kernel as II. 
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Proof. We extend the proof of [2H Proposition 4] to the higher generahty here of decomposing 
along a Markovian path. The family of times (cr„, n > 1) is a stopping line for the filtration 
T = {J-'t,t > 0), with respect to which A is a Markovian path in 11. Here, we use the terminology 
of Bertoin [71 Definition 3.4] and seek to obtain from (71 Lemma 3.14] that the extended branching 
property holds, which yields precisely the result we need. Since Bertoin uses natural filtrations, 
and to demonstrate where the Markovian assumption on the path enters the argument, let us 
briefly retrace Bertoin's steps and sketch relevant parts of the proof of the extended branching 
property. Without loss of generality, J-" is the filtration generated by (11, A). Also denote by 
J^("' the filtrations generated by (n^"), A./xo-„), for each ?i > 1. We consider approximations 
ai^^=mi{te hN-.n^At}, ai!''''^ = mm{kh,ai^^) and ai'''*^^ = inf{t G {h,2h, . . . ,kh}: n ^ At}} 
with inf := oo, of o"„. The branching property at the stopping line {an ' ,n > 1) is just the 
branching property at t = h. At h, or by induction hypothesis at {an ' ,n > 1), the assumption 
on the path to be Markovian ensures that {Il{kh + t) n ^fc/i,^fc/j+t,t > 0) is conditionally 
independent of ({H^") (ai'''^^ + t),n^ Akh},t> 0) given 



^f (k.h) ^in; defined as the sigma-algebra generated by F ,l^.,n > 1. 



To {J\kh+t n ^fc/i, Afc/j+t, t > 0), we can apply the branching property at t = /i and trivially at 
t = oo to complete the induction step from A; to A; + 1 . This establishes the extended branching 
property at {an ' ,n > 1) and {Wn ' \ri> 1) for all /i > and k > 1. We omit the remainder 



_2- 



i2fe 



of the proof, which uses the standard approximation Cn ' i o"„ as /c — )• oo. D 

The next lemma and its proof demonstrate that we can iterate the embedding of a Markovian 
path in a homogeneous fragmentation process to embed a bead splitting process in an associated 
self-similar CRT to which the bead splitting process converges almost surely. 

Lemma 23 Let A = {At,t > 0) he a Markovian path in a homogeneous fragmentation process 
n = (n(t),t > 0) with symmetric dislocation measure v{ds) = f*{s)ds and Mt = \At\, t > 
its residual mass process. If the Mn, n > 1, are independent copies of M , then for each 
a>0 the sequence of weighted random 'R-trees {Rn,fJ.n) converges almost surely in the Gromov- 
Hausdorff-Prohorov metric to a limit tree {T,fi), which is a copy of the a- self- similar CRT that 
is canonically associated with H. In addition, we also have {Rn,Vn) -^ (^)/^) almost surely in 
the Gromov-Hausdorff-Prohorov metric, where Vn is the uniform measure on the n leaves of Rn ■ 

Proof. We can consider the spinal partition 11 (0) of 11 and use Lemma [22] to construct in a 
measurable way (see e.g. [211 Corollary 3]) a string of beads (i?i,/Ui) = ([0,y|^|^a^i], /U|yi|^a_i) with 
a collection {si,T^^\ n^^') of spinal subtrees constructed from {I{{a^:^^Yi^rQ\+t)r\Ilf{Q),t>{)), 

where Si = Jq * |j4j.|°dr, i > 1, such that the tree {T,fJ^) obtained by grafting (r(*),/i'-*') 
to (i?i,0) at Si for all i > 1, is a self-similar CRT with splitting density /*. This construction 
gives rise to a family of regular conditional distributions of (i?i,^i; (sj,T*'*',/i(*'), i > 1) given 
(T, //), and we can use these via the lonescu-Tulcea theorem to obtain a probability space that 
allows the following construction. 

Suppose we have constructed (i?„,/i„; {xi,T^^' , fj,^^'),i G /„) with i?„ C T, fin the projection 
of fi onto Rn and, conditionally given {Rn,Hn), a collection {{T^^' , fi^^>),i G /„) of independent 
copies of {T,n) subjected to a-self-similar scaling by fi{{xi}), which when grafted at Xi G Rn 
for all i £ In give (T, /u). Now pick a junction point J„ = Xi^ from i?„ according to fin- Given 
that fin{{Jn}) = "m, remove Jn from fin and remove in from /„. Use the regular conditional 
distribution given the rescaled chosen subtree (T'*"',^*-*")) to obtain a string of beads with 
grafted spinal subtrees distributed as {Ri, fii; {si,T^^\ M^ ))^ ^1); without modifying the chosen 
rescaled subtree. After a-self-similar scaling by m, graft the string of beads at J„, add the new 
spinal subtrees to the collection to form {Rn-\-i,fin+i', {^i^T^'^' ■,fi,^'^'),i G In+i)- Then Rn+i C T, 
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Hn+1 is the projection of /_f onto Rn+i and, conditionally given {Rn+i, fJ-n+i), the collection 
((rW,^(*)),i G In+i) consists of scaled independent copies of {T,fJ.) that turn Rn+i into {T,fJ.) 
when grafted at Xj, i S In+i- 

By induction, this gives a sequence {{Rn, tJ^n),'n- > 1) embedded in (T, ^), which develops by 
size-biased branching and is based on independent copies of the string of beads associated with 
M = 1^1 . While constructed within a CRT, this sequence has the same (joint) distribution as 
the sequence described in the statement of the lemma. It therefore suffices to prove almost sure 
convergence for this embedded sequence. 

First, consider the measures /z^, n > 1. Denote by l/^nK S S'^ the decreasing rearrangement 
of bead sizes /i„({3;}), x £ Rn- Since A is embedded in {T,fi), the measure /ii cannot have an 
atom of size 1, in particular there is A < 1 such that P(|^i|j < A) > 0. Now let e = 1/K > 0. 
By selecting the K largest beads in turn, we see that Pd/in+^li < Asi V e \ |//nK = s) > for all 
s G 5^. For m with X"^ < e this implies p = P(|/imli < e) > 0, but then |/Linli will be less than e 
after a time that is bounded above by m times a geometric random variable with parameter p. 
In particular, 

P(/.in has an atom of size greater than e for all n > 1) = 0. (38) 

Now denote by i?oo the completion of the increasing union Un>i -^"- ™ ^ ^^*^ assume that 
IP(-Roo 7^ T) > 0. For X e T\ -Roo, we find y £ Roo such that~]]y,x]] n it^oo = 0, but then 
n{Ty) > 0, since fi assigns positive weight to all fringe subtrees. Since fin is the projection of 
fi onto Rn C -Roo) this contradicts (j38p . Hence P(i?oo = T) = 1- Similarly, assuming that Rn 
does not converge to T for the Hausdorff distance on T, we can use compactness to find x G T 
with d{x,Rn) > £ for all n > 1, so x R^o is a contradiction. Also, Hausdorff convergence 
of Rn C T to T with projected measures implies Gromov-Hausdorff-Prohorov convergence 
dGHp{Rm T) — )■ almost surely as n — )• oo (see e.g. [281 Lemma 17]). 

Finally, the measure Vn on the n leaves on Rn is more and more closely coupled with the 
measure i^* on T associated with a sample S|, . . . , S* from ^, which is well-known to converge 
weakly almost surely to /i. We can take as E^ an independent pick from /x and include ^n+i ^^ 
the construction of {Rn+i, f^n+i)- Specifically, we can obtain the pick from fin for the junction 
point Jn as the junction point of the subtree containing S*^^. Since Rn — )• T almost surely in 
the Hausdorff sense, there is no such that all subtrees of T\ Rno have height less than s/2, but 
then the distance between S*^^ and the (n + l)st leaf of Rn+i, which are in the same subtree 
by construction, is at most e for all n > no, which entails the result by standard arguments. D 

Proof of Theorem 1211 Let M be a fragmenter with splitting density /. By the if part of 
Theorem [T4l there is a Markovian embedding of M into H for a binary homogeneous fragmenta- 
tion process with dislocation measure v{du) = f*{u)du, where f*{u) = uf{u) + (1 — u)f{l — u). 
Hence, Lemma[23]applies and gives {Rn, fJ-n) — ^ (T, /i) and {Rn, Vn) — ^ {T, fi) almost surely in the 
Gromov-Hausdorff-Prohorov sense, for an a-self-similar CRT (7~, /u) with dislocation measure 
f*{u)du. U 

We can now use Theorem [2T] and Lemma [23] to complete the proof of Theorem [141 

Proof of the only if part of Theorem I14L Let / be a splitting density. Consider the bead 
splitting process {{Rn, fJ'n),'^ > 1) based on a sequence M„, n > 1, of fragmenters with splitting 
density /. In Theorem [211 we showed that {Rn,fin) — ^ {T,fj) almost surely for a CRT {T,fL) 
with (symmetrised) splitting density /* associated with /. 

Now assume that a fragmenter with splitting density / has a Markovian embedding A into 
a binary homogeneous fragmentation process H with any symmetric splitting density /. By 
Lemma [23l {Rn,fin) -^ {T,]!) almost surely for a CRT with (symmetrised) splitting density /. 
By uniqueness of limits, (T, /I) = (T, fi). Since the distributions of CRTs for different dislocation 
measures are different, we find that f* = f almost everywhere. D 
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Remark 24 With the usual names Si, . . . , S.„ of leaves of i?„, n > 1, any bead splitting process 
embedded in a CRT (T, fj.) gives rise to a, typically non-exchangeable, "P-valued process 

n„(t) = {{j >l:^j£Tt},i>l}U {{j}J > 1: Sj 7-*}, t> 0, 

of the same form as the exchangeable special case H* in ([25]) . Furthermore, if (i?„, //„) — )■ (T, //) 
as in Lemma [23l we find equality of the decreasing rearrangements of asymptotic frequencies 
|nQ,(t)|~'' = |(/x(7^*),i > 1)1^'' = |n^(t)|~'' for all t > a.s.. So it is natural to perform the inverse 
of the self-similar time change (j22p to construct a, typically non-exchangeable, homogeneous 
process 11 = Ho from the consistently time-changed evolution of its blocks containing n, n > 1. 

Our proof of the only if part of Theorem [T3] used the size-biased bead splitting process and the 
compactness of self-similar CRTs to show that the Markovian path A gives rise to an embedding 
that exhausts a CRT. The embedding for the if part of Theorem [T3] was not carried out in a 
CRT, but directly in an exchangeable homogeneous fragmentation process, and we can rephrase 
our bead splitting argument for the only if part here to directly construct a non-exchangeable 
process 11 based on A by embedding into an exchangeable homogeneous fragmentation process 
n*, as indicated below. However, this is harder to formulate, and we lose natural compactness, 
so we do not attempt an alternative proof, but let us give the direct construction of 11. 

Let ^ be a Markovian path in H*. Define branch times Ji^„ = inf{t > 0: n At}, n > 2, 
between 1 and n. Given Jj^„, n > i + 1, have been constructed for all i € [k], consider the time 
Hk+i = max{ Jj^fc_|_i, i S [k]} when k+1 separates from the last i € [k]. Relabel the restriction of 
(n*(i^fc+i+t), t > 0) to the block -B^+i of n*(//fc+i) that contains k+1 by the increasing bijection 
Bk+i —7- N. Run a copy of A inside this process, relabel back N — ?• B^+i to find a Markovian 
path B^^^^' that we specify to coincide with the canonical path A^^^ of n* up to -fffc+i and to 
continue in {U*{Hk+i+t)nBk+i.,t > 0), as constructed. Define Jk+i,n = inf{t > 0: n ^ B^ }, 
n>k + 2. Finally set J^^k = 00, J^^n = Jn,k for n < A; and define the embedded P-valued process 

nj = {{n>l: J„,i>t},i> 1}, t>0. (39) 

Corollary 25 Let A he a Markovian path in an exchangeable homogeneous fragmentation pro- 
cess n*, and let 11 be as in (139p . Then |n|~'' = |n*|-''. Moreover, if\A\ is a fragm,enter, then H is 
a homogeneous fragmentation process with binary non- exchangeable K-measure [27] of the form 

K{{T^V:Tr\[n] = {^uTi2)})= I u*^^-\l - u)*^^uf{u)du, {vri,7r2} G P„ \ {{[n]}}, 

where Vn is the set of partitions of [n], with k(T £ V \ {{N}} : Fi U r2 7^ N) = 0. 

Proof. We leave the equivalence of the two constructions of 11 to the reader and just point 
out that the CRT construction of Remark 1241 yields |n| = |n*|. For the second claim, we note 
that the splitting rule of the fragmenter is /(u), so, by standard thinning properties of the 
Poisson point process of jumps of the fragmenter M and size-biased branching, we identify the 
dislocation measure. D 

Remark 26 It may be observed from the form of the bead splitting process in the case of an 
independent and identically distributed sequence {Mn,n > 1) that the size-biased bead selection 
rule is not crucial for convergence to a CRT since it mainly affects the (random) time n at which 
a particular bead is split. In the proof of Lemma [23l the main use of the size-biased selection 
rule was to establish (p8]) . Indeed, as long as we split every bead eventually, we are quite free 
to choose the order in which we split the beads and may even contemplate rules like splitting 
all beads of /i„ at once at every stage of the bead splitting process. 

The reader may also want to compare our bead splitting processes with Abraham's [1] 
construction of a version of the Brownian CRT. Let us rephrase Abraham's construction in 
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our present framework. Given a Brownian CRT, consider the spine from the root to the highest 
leaf, then attach to each spinal vertex the spine to the highest leaf of the corresponding spinal 
subtree, then recursively attach to each subtree of these spines their spines to the respective 
highest leaf. Since the path to the highest leaf is not a Markovian path, and indeed the spinal 
subtrees are not rescaled copies of the Brownian CRT, but copies constrained in height, this 
falls outside the setting of Lemma [23l 

Intuitively, the best we can do in our Markovian framework to end up high is to use the 
switching probabilities of Example [8] and always choose the bigger fragment. The homogeneous 
Poissonian structure for relative masses {Ft, 1 — Ft) on the spine to the embedded leaf easily 
entails that this does not lead to the highest leaf a.s. 

3.3 The Brownian CRT 

Let {T, fJ.) denote the Brownian Continuum Random Tree T equipped with its mass measure 
/i, which Aldous [3] constructed both as the tree embedded in (twice the standard) Brownian 
excursion, with fx corresponding to Lebesgue measure on [0,1], and as a limit as n — t- oo of 
an increasing sequence of binary subtrees with edge-lengths Rn with n leaves labelled by [n], 
in which case /.i may be interpreted as the almost sure weak limit as n — )■ oo of the uniform 
probability distribution f„ on the n leaves of Rn ■ The tree Rn may be constructed as the subtree 
of T spanned by n leaves of T, which given T are picked independently according to the mass 
measure fi. We recover this second construction in Theorem [21] for f{u) = -j=u~^''^{l — u)~^''^, 
enriched by the string of beads structure given to the branches of i?„ by measures /i„,. According 
to a basic result of Aldous p], the increasing sequence of lengths (A(i?i), A(i?2)) • • •) of these 
subtrees can be constructed as A(i?„) = \/2r„ where r„ = ei + • • • + e.„ for a sequence of 
independent standard exponential variables Ej. For n = 1 there is the identity in distribution 

X{Ri) = LliB''') (40) 

where B^^ is a standard Brownian bridge, starting at at time and ending at at time 
1, and (Lf(S ),0 < i < l,x € M) is the jointly continuous process of local times of B , 
normalised so that Lf{B^^)dx is the occupation measure of (-B^''(s),0 < s <t). The common 
distribution of both sides in (j40p has the Rayleigh density xexp(— x^/2) at x > 0. The work 
of Aldous, Miermont and Pitman [4j yields a a deeper connection between the Brownian CRT 
in a Brownian excursion on the one hand and Brownian bridge on the other. This establishes 
a spinal decomposition result for the Brownian CRT via path transformations rather than via 
Bertoin's extended branching property as in [21] or Lemma[22]here. See also Bertoin and Pitman 
[HI Theorem 3.2] for an expression in terms of paths rather than trees. To explore this spinal 
decomposition, let (r^, < £ < L\{B^'^)) be the inverse local time process 

n ■■= inf{t > : L'^ti.B^l > ^} 

so that the collection of excursion intervals of B^^ is exhausted by 

{{Te~,Ti) : £ > 0,T£_ < Ti}, 

and let P be the random discrete distribution obtained by ranking these intervals by length. 
On the other hand, in the Brownian CRT (T, /x), for < i < X{Ri) let /xi([0,^]) denote the 
mass of all points x £ T such that the path from root to x in T branches from the path from 
root to leaf Si of T at a junction point on Ri whose distance from the root of T is at most 
i. Then according to the spinal decomposition of the Brownian CRT implied by [H Lemma 9 
and Equation (12)], the equality in distribution ()40p extends to the equality in distribution of 
processes 

(W([0,^]),0 < i < X{Ri)) = {Te,0<i< L^.iB"^')). (41) 
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Moreover, conditionally given the process on the left side of (|4ip . the Brownian CRT (T, /x) 
decomposes into a countable collection of subtrees 

{Ti,0<e<X{Ri),fiim)>0} 

where ?£ is a Brownian CRT equipped with a mass measure ^i with total mass ^i{Ti) = 
/ii({£}) > 0, in a way which corresponds to the decomposition of the path of l-B^'^l, the ab- 
solute value of B^^ , into excursions of lengths {r^ — r^_, < H. < Lq{B^^),ti- < Te}. Indeed, the 
entire Brownian CRT T can be constructed from B so that the equality in distribution (j41|) 
holds almost surely, and for each i with //i({^}) = ti — t^- > 0, the subtree 7^ of T attached 
to the spine Ri of 7~ at distance £ from the root is constructed from the excursion of \B \ 
on (t£_ , Ti) in the same way that the entire tree T is constructed from a standard Brownian 
excursion. In particular, basic properties of Brownian excursions then imply the spinal decom- 
position ofT, that conditionally given the subtree masses {fJ-iH^}), < i < X{Ri)), the subtrees 
{Te,fJ.e) associated with i such that ^i({^}) > form a collection of independent random trees 
distributed like i\/fMXW\)T, /ii({^})/u), meaning that all edge-lengths in T are scaled by a factor 
of a///i ({£}), while all masses are scaled by a factor of fii[{£}). 

The distribution of ranked masses of atoms of /xi is the PD(2, 2) distribution of ranked 
lengths of excursions of Brownian bridge, and masses corresponding to these lengths are dis- 
tributed along the spine of length X{Ri) in an exchangeable random order. Moreover, the length 
X{Ri) is itself a measurable functional of the PD(2) 2) r^iiidom discrete distribution of masses 
along the spine, as discussed in [JU 126] , 

For n > 1, the bead splitting process {{Rn, fJ'n),'^ ^ 1) described in terms of decreasing mass 
processes in Definition [18] can be described in terms of Brownian bridges, as follows. 

• Start from a segment 

(i2i,Mi) = ([0,L;(i?b'-)],dT) (42) 



where dr denotes the Stietjes measure with cumulative distribution function given in (I4ip . 
associated with the inverse local time r of a standard Brownian bridge B^^ of length 1. 

• Given {Rn,Hn), pick a junction point J„ from i?„ according to fin- Given that /U„({J„}) = 
m, remove the mass m from point J„ and attach as segment (]] J„, S.„_|_i]],/iiri,n+i) a copy 
of (I42p derived from a Brownian bridge of length m. 

Specifically, a Brownian bridge of length m may be constructed from the standard Brownian 
bridge B^'^ as m^''^B^'^(t/m), < t < m. That is to say, Rn+i \ Rn =]]Jn, ^n+i]] is such that 

X{Rn+i) - X{Rn) = (/i„({J4))'/'i?(i?S)) 
for some independent and identically distributed sequence of standard Brownian bridges 

and given fin{{Jn}) = "i, the mass m should be reallocated with a portion (r^ — Ti_)m placed 
at distance £m^'^ from J„ along the branch of length X{Rn+i) — X{Rn) from J„ to Sn+ij for 
each £ G (0, ^iC^f^))) with r£_ < r^ 

The above prescription specifies the projective sequence of weighted M-trees {{Rn, fJ"n),n > 1) 
as in Theorem [211 which is associated with the Brownian CRT as in [19] and as indicated at 
the beginning of Section [3| We wish to point out some special properties of this Brownian tree 
growth sequence: 
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Proposition 27 Let {{Rn, f^n),'n- ^ 1) be the bead splitting process derived from a sequence of 
Brownian bridges, as above. Then we have the following description of the law of (Rn, l-i-n)- 

(i) The sequence Pn = {Pn,i, Pn,2, • • •) of sizes of ranked atoms of jin has PD(i, (2n — l)/2) 
distribution. 



(ii) The total length \{Rn) can be represented as both A(i?„) = \/2Tn where Tn = Si+- ■ ■+£n for 
a sequence of independent standard exponential variables ei, and as \{Rn) = Si/2{Pn)/V^, 
where 5*1 ^(-Pn.) is the ^-diversity of Pn, which may be recovered from Pn as 



Si/2{Pn) = V^ lim kP^ 



A;— >oo 



1/2 
n,k 



a.s. 



where Pn^k is the kth largest fi-measure of the collection of all fringe subtrees ofT attached 
to Rn, or again as 

SiniPn) = lim m~^/^K„m 

where Kn^m is the number of junction vertices Ji with i < m such that Ji G i?„. 

(iii) Conditionally given A(i?„), the tree Rn consists of2n — l segments, whose relative lengths, 
when listed in order of depth-first search of Rn, passing first along [[0, Si]] = Ri, then along 
]]Ji,E2]] = R2\Ri, and so on, is independent of Rn, with the same Dirichlet distribution 
with 2n — l parameters equal to 1 as the sequence o/2n — 1 spacings generated by a sequence 
of 2n — 2 independent uniform variables on [0, 1]. 

(iv) Forn > 2 the combinatorial shape of Rn is equally likely to be any of the lx3x • • • x (2n — 3) 
possible shapes of binary trees with root and leaves labeled by [n], independently of \{Rn) 
and of the sequence of relative lengths of segments. 

(v) Conditionally given \{Rn) = ^i, the combinatorial shape of Rn and the sequence of relative 
lengths of segments, let (cjt,, < f < Hi) be a path which traverses Rn at unit speed, passing 
first along [[0, Li]] = i?i, then along ]]Ji,L2]] = R2\Ri, and so on, and let Rn/ be the 
range of {ay, < v < £) , so that by construction X{Rn/) = i for all < i < A(i?„) = ii. 
Then the cumulative mass process (;U„(i?„^£), < i < A(i?„) | A(i?„,) = ii) has the same 
distribution with exchangeable increments as the inverse of the local time process at of a 
Brownian bridge B^'^ conditionally on Vl{B^'^) = £i. 

Proof. This can largely be read from the preceding discussion and known descriptions of 
{Rn,n > 1) and properties of Poisson-Dirichlet distributions. Partial results appear in many 
places, including [3l Section 4.3], [20l Proposition 18] and [281 Proposition 14]. 

Part (i) is a particular case of crushing a size-biased pick from of PD(q;, 0) with PD(a, 1 — a) 
into PD(a,0 + 1), as established by [H], which can also be read from Aldous's sequential 
description of the growth of {Rn,n > 1). That description implies part (v) quite easily. To 
deduce (ii) from (v), observe that P„ is the sequence of ranked jumps of the cumulative mass 
process {iJ,n{Rn,i) , < £ < X{Rn)) which given X{Rn) = ^ is distributed like the ranked lengths 
of excursion intervals of a standard Brownian bridge. As n changes, the distribution of Pn is 
therefore obtained from that of Pi by tilting the distribution by the density factor between 
the exponential distribution of \{Rif' /2 and the Gamma(n, 1) distribution of X{Rn)'^ /2. But 
this factor is just {X{Ri)^/2)"'-^ /r{n) where X{Rif/2 = (5i/2(Pi))V4, which is precisely the 
density factor between PD(i, (2n - l)/2) and PD(i, i), see (Ml Theorem 3.13]. 

As another check of the consistency of the two different descriptions of X{Rn), observe that 
the description in terms of independent exponential variables gives 

EiXiRnY) = 2^/2]E(rP/2) = 2P/2r(n + p/2)/r(n) (43) 
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whereas for S^^q the a-diversity of a PD(a, 0) random discrete distribution it is known [26] that 



_ T{e/a + p + i)ne + i) 

^^^a,e) - r(0 +pa + l)T{e/a + 1) ^ ^ 



so in particular 



Ff^P r(2n + p)r(n+l) _ 2^r(n + p/2) 

^ V2,(2n-i)/2i - r(^ ^ ^/2 + i)r(2n) " r(n) ^^^^ 

where the second equahty uses the Gamma duphcation formula T{2z) = 2^^~^r(z)r(z+^)/r(^). 

D 

A Proof of Lemma [6] 

First note that the assumptions imply that r is exponentially distributed and, furthermore, that 
for all nonnegative Borel functions g and all t > 

ng{Fr)l{r>t}) = nr > m{g{Ft+r-t)\T >t)= P(r > t)E{g{F^)), 

so that T is independent of F,-. Since r is a stopping time, (Ft-+s,s > 0) is independent of 
Jv and has the same distribution as {Fs,s > 0). Consider a sequence of independent copies 
{Fs , < s < Tn), n > 1, of {Fs, < s < r) and splice them together as 

(^n + -+r„_i+s,^ri + -+r„_i+s) = (i^i'\ l{,=r„}), < S < r„,, 71 > 1. 

Then {{Ft,t > 0),t) = {{Ft,t > 0),r), by construction, where r = ri. Also by construction, 
F = {Ft,t > 0) and F* = {{Ftfht + (1 — •mt)),t > 0) are J^-Poisson point processes in their 
joint natural filtration J^. The intensity measure of F* is the distribution of F^ times the rate 
of T. Since Fr ^ 1 a.s., this intensity measure is absolutely continuous with respect to A as 
otherwise, we could find a Borel set A with P(-Ft- £ A) > and A(A) = 0, so the restrictions 
of F and F* would reveal points of F* that are not points of F, a contradiction. We denote 
the Radon-Nikodym derivative by K{u,dk) and set p{u) = K{u,{l}), so that F' is a Poisson 
point process with intensity measure p{u)A{du). We note that p{u) < 1 for A-a.e. u € (0, 1) as 
otherwise restrictions to the Borel set A = {u £ (0, 1) : p{u) > 1} would reveal points of F* that 
are not points of F, a contradiction. 

The proof is not complete yet, because we have not yet shown that {{Ft, fht),t > 0) is an F- 
Poisson point process, or equivalently that the unmarked points F° = {{Ft{\ — frit) + fnt),t > 0) 
form an J^-Poisson point process. We can represent the point process F° as a random measure 

N^{A) = i^{s <t: F° eA}, t>0 and A Borel subset of (0, 1). 

For A with A{A) < oo and J^p{u)A{du) > 0, we now show that {Nt{A),t > 0) is an J^-Poisson 
process. Consider Nt{A) = ^{s < t: Fg £ A}, which we know is an J^-Poisson process with 
rate A{A). The time r = inf{t > 0: F* G A} is an J^-stopping time, since F' is an J^- Poisson 
point process. Denote by {Tn,n > 1) the times of the points of N{A), also .F-stopping times. 
Let An = {t = Tn} and set q = P(r =Ti). Then, by the strong Markov property of N{A) at 
Tn, we find that also for n > 2 

F{An) = P(r^ / ri)P(A|T^ > Ti) = (1 - q)F{An~i) = (1 - q)"-'q- (46) 

Hence, we find that r = Tq for a G geometric with parameter q, which is a stopping time in 
the discrete filtration {FT„,n > 1). By Wald's equation 

E(r^) = E(G)E(ri) hence / p{u)A{du) = qA{A). 

J A 
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From (|46p and the strong Markov property at r , we deduce that {N*{A), N{A)) are such that 
N*{A) is a g-thinning of N{A), so N°{A) = N{A) — N*{A) is also a Poisson process with rate 



(1 - q)A{du) = / {l-p{u))A{du). 
Ja 

By |23l Theorem 12.8], this suffices to identify A^° as a Poisson random measure with intensity 
measure (1 — p{u))K{du). D 
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